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Abstract 

We consider a model of asexually reproducing individuals with random mutations and 
selection. The rate of mutations is proportional to the population size, N. The mutations 
may be either beneficial or deleterious. In a paper by Yu, Etheridge and Cuthbertson (2009) 
it was conjectured that the average rate at which the mean fitness increases in this model 
is 0(logiV/(loglog A)^). In this paper we show that for any time t > 0 there exist values 
Cat —> 0 and a fixed c > 0 such that the maximum fitness of the population is greater than 
cslog A/(loglog A)^ for all times s G [eAr,t] with probability tending to I as A tends to 
infinity. 


1 Introduction 

In this paper we consider an evolutionary model of a population of individuals that was first 
introduced by Yu, Etheridge and Cuthbertson [7]. The model has parameters A, q, /i and 7. The 
population size is a fixed integer A > 0. The rate at which each individual gets mutations is 
/r > 0 and each mutation is beneficial with probability q, where 0 < < 1, and deleterious with 

probability 1 — q. Each beneficial mutation increases an individual’s selective advantage by 7 > 0. 
To make the description precise, the stochastic process we are interested in is 

{Xt:t> 0) where Xt = (A/, A^,..., xf) G 

is an A-dimensional vector for t > 0. Each coordinate represents the fitness of an individual 
in the population at time t. Eor all coordinates i,j € {1,2,... , A} the model has the following 
transition rates: 

A* ^ A* + 1 at rate g/r, 

A* ^ A* — 1 at rate (1 — g)/x, 

A* ^ X^ at rate —, 

A 

A* ^ A^' at rate —(X^ - A*) V 0. 
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The first two transition rates correspond to beneficial and deleterious mutations respectively. The 
rate 1/N at which X* —>■ is the resampling mechanism. At rate one an individual is chosen 

to give birth uniformly at random and an individual is chosen to die uniformly at random. The 
rate — X^)/N V 0 at which A* —)■ X^ is the selection mechanism. Mutation, resampling and 
selection events occur independently of one another. 

With the vector valued process X in mind we can think of A* as the fitness of individual i. 
That is, if individual 1 dies and individual 2 gives birth, rather than thinking of the population as 
having lost one individual and gaining a new individual we think of individual 1 as adopting the 
fitness of individual 2. From the perspective of A as a vector valued process, to say individual i 
has died and individual j has given birth we only mean that the value of A* has been changed to 
the value of XK This way, we can continue referring to the individual whose fitness is recorded 
in coordinate i as the zth individual even though from the biological perspective the individual 
whose fitness is in the ith coordinate will change over time due to death and birth events. 

It is important to note that with the vector valued perspective in mind, when we say individual 
i dies due to resampling we mean that the value in coordinate i has changed its value to a randomly 
chosen coordinate. By the dynamics of the model, such events occur in each coordinate at rate 
1. Each individual can only decrease in fitness when it gets a deleterious mutation or dies due 
to a resampling event (although death due to a resampling event does not necessarily mean that 
the fitness of the individual will decrease). The rate at which events occur that may cause an 
individual’s fitness to decrease is 

d = (1 + g)// + 1. 

The value d is the same for each individual and does not depend on the population size N. 


1.1 Previous Results 


Let 



2=1 


be the mean fitness of the process at time t. Let {X^ : t > 0) be the stochastic process centered 
about its mean so that the zth coordinate of X^ is 


xf = Xi- A*. 


It was shown in [7j that the centered process is ergodic. Let tt be the stationary distribution. If 
we choose Xq according to the stationary distribution then the expected variance of the fitnesses, 

1 ^ 

E^[c 2 ] where C 2 = — 

2 = 1 

does not change in time. Equation (12) of [7] states that 


E-^[Xt] = {^Ji{2q-l)+sE'^[c2])t 


where E^ means that the initial value of the process was chosen according to its stationary 
distribution. This establishes a constant rate of increase of E'"[Xi\ in time. Computing E'^\c 2 \ 
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would give the rate of adaptation. This sort of technique was used by Desai and Fisher [3] for a 
related model. However, computing E^[c 2 ] is a difficult problem so we take another approach. 

A heuristic argument in [7] showed the rate of adaptation should be O(log A^/(log log A")^). 
They were able to establish a rigorous lower bound which showed E[Xt] increases at a rate of 
0(log^~'^ N) as t tends to infinity for large enough values of N. In Kelly [Ij it was shown that 

E[Xt] ^ Clog A 

t ~ (log log A) 2 

for some constant C > 0. 

Similar results have been conjectured for the rate of adaptation in the biological literature. 
For examples, see Rouzine, Brunet and Coffin [5], Brunet, Rouzine and Wilke [2] and Rouzine, 
Brnnet and Wilke [6]. Technical, bnt not rigorons, arguments are given for the mean rate of 
adaptation of similar models. The arguments rely on two assnmptions. First, one assumes that 
the bulk of the population behaves deterministically and the random noise is concentrated only 
in the highest fitness classes. Second, one assnmes that the bulk of the population is shaped like 
a Gaussian cnrve. The one or two highest fitness classes behave as stochastic populations that 
are expected grow dne to the selective advantage they have over the bnlk of the popnlation. The 
rate of adaptation can be estimated by computing the mean time to the creation of a new highest 
fitness class. 


1.2 Main Result 

Fix a time t > 0. Let 

^ ^ 16(log log A)^ 
7 log A 

represent a time valne and let M be the integer snch that 


M < 4- < M + 1. 

2T - 

Define 

A+ = max{A* : 1 < f < A} 

for all f > 0. 


Theorem 1. Let 


Then 


f = |a+>^/ or se[r,t]}. 
lim P{£) = 1. 

N^oo 


Establishing Theorem [T] shows that with probability tending to 1 the maximum fitness of the 
population is at least 0(log A/(loglog A)^). The result differs from the results in [7j and [Ij in 
that it is a convergence in probability rather than a convergence of the mean fitness. Also, the 
result agrees with the heuristic argument given in [7]. 
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2 Proof of the Theorem 

For the remainder of the paper t > 0 is a fixed time value. Define the sequence (s* ; f > 0) by 

Si = 2iT. 


Let M be the integer such that 

sm <t < Sm+1- 

Note that M tends to infinity as N does because t is fixed and T tends to 0 as tends to infinity. 

At the times s* we wish to label an individual with maximum fitness. There may be multiple 
individuals that have the maximum fitness. To make a precise choice for an individual with 
maximum fitness but arbitrary with respect to which index, for i > 0 define 

a{i) = mm{j : Af, = A+}. 

When j = a{i) we say individual j is labelled a over the timer interval Likewise, 

we want to label an individual that is considered to be the second most fit individual in the 
population. The second most fit individual may have the same fitness as the most fit individual. 
For i > 0 define 


/3(f) = min{j ^ a(i) : X^. > X^. for A: G {1, 2, ... , A}\{a(f)}}. 

We define a Poisson point process Va that marks times at which individuals labelled a may 
decrease in fitness. If individual a(f) gets a deleterious mutation or dies due to a resampling 
event at time t G [sj,Sj-|_i) there is a point of Va at time t. Similarly, we define a Poisson point 
process for the individuals /3(f), f > 0. Both Va and Vp are Poisson point processes on [0, oo) 
of intensity d. The point processes take values on the measure space {G,Q) where 

G = {x C [0, oo) : |x n i3| < oo for all P G 0 such that X{B) < oo} 

where B is the Borel cr-field on [0, oo) and A is the Lebesgue measure and 

G = cr{{x G G ; |x n i3| = m} : B £ B,m>0). 

Let V = Va UP/ 3 . Since Va and Vp are independent, P is a Poisson point process of rate 2d that 
marks times at which individuals labelled a or /3 die. For K >0 define 

Ax = {x G G ; |x n [0, sm+i]| < A, |x n [0, si]| =0 and |x n [sj, Sj+i)| < 1 for 1 < f < /VL}. 

Note that Ax G G for K > 0. 

Proposition 2. Let e > 0. There exists K > 0 sueh that P(P G Ax) > 1 — e for large enough 
values of N. 

Proof. Let <3 > 0 such that (1 — S)'^ > 1 — e. For an interval I we define VI to be the number of 
points of P in the interval I. Then P[0 ,sm+i] has the Poisson distribution with mean 2dsM+i- 
Since 

lim SM+i = t, 

N^OO 
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for N large enough sm+i < t + 1. Because f + 1 is fixed, there exists K large enough so that 

K 

Y^P{V[0,t + l] = k)>l-S. 

k=0 

We expand over the conditional probability to get 

K 

P{V ^Kk) = Y^ ^ Ax|^[0, t + l]= k)P{P[0, t + l] = k). 

k=0 

On the event {V[0,t + 1] = /c} the k points Ui,...,Uk € P are i.i.d. and have the uniform 
distribution on [0,t + 1]. Therefore, 

k 

P{V € Ak|P| 0, ( + 1| = k) > n 

2=1 

Because T tends to 0 as tends to infinity, P{V G KK\P[^,t + 1] = /c) tends to 1 for any fixed 
k. Since K is fixed, for large enough values of N we have 

P{V G Kk\P[^, t + 1] = k) > I - 5 ioi Q < k < K. 

By our choice of K, for N large enough 

K 

P{V gAk) = Y^ P{V G ^k\V[Q, t + 1] = k)P{V[{), t + l] = k) 

k=0 

K 

> J](l-W^[0,t + 1] = /^) 

k=0 

> 1-e. 


□ 

For i > 0 we define stopping times Ta{i), Tp{i) and r(z) so that we can couple the progeny of 
individual a{i) with a branching process over the time interval [sj, (sj + T) A Ta{i) A r(z)) and 
j3{i) with a branching process over the time interval [sj, + T) A Tp{i) A r(i)). Dehne 

Ta{i) = inf{s > Si : s G Va} and r/ 3 (z) = inf{s > Si : s G Vis}- 

Define a process {Wg : s > 0) by Wg = A: if at least N/2 individuals have fitness in [A, oo) at time 
s but fewer than N/2 individuals have htness in (/c, oo) at time s. Let 

W= 

8 log log N 

and 

r(i) = inf {s > Si ; Xf/*) - W, < w} . 
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Let {Zg : s > 0) be a multi-type branching process that initially has one particle of type 0. 
Particles have death rate d and give birth to one offspring at rate 


w = 


7>V 

2 


Particles change from type-0 to type-1 at rate qjj,. Let {Zf{i),Zs{i) : i > 0) be a collection 
of i.i.d. branching processes each having the same distribution as Z. We couple X with the 
branching processes Zf{i) over the time intervals [sj, {si +T) /\ Ta{i) A r(i)) as follows: 


• At time Si the particle in is coupled with individual a{i). Over the time interval 

[0, ((sj + T) A Ta{i) A r(i)) — Si) each particle of Z°‘{i) will be coupled with an individual in 
A. 

• Particles in Z increase in type by 1 at time s if and only if the corresponding individual in 
X gets a beneficial mutation at time Si + s. 

• The one particle in dies at rate d independent of the process X. Any other particle 

of Z°‘{i) dies at time s if and only if the individual it is coupled with gets a deleterious 
mutation or dies due to a resampling event at time s* -|- s. 

• To explain the branching events of Z°‘{i) we give another description of the selection mech¬ 
anism. For each individual f in A define a Poisson process on [0,oo) x [0,oo). The 
collection of Poisson processes (P* : 1 < f < A) are independent of one another and each 
have intensity A x where A is Lebesgue measure. For a fixed individual j, define 

N 

ti = ^(A'i - xi)+. 

k=l 

Individual j gives birth at time s if there is a point of at (s, x) for some x < On the 

event that individual j gives birth individual k is chosen to die with probability 

(Xj - X^)+ 

Define 0i to be the sum of terms in which are at least as large as W: 

N 

©s = )^{xi-X^>W}' 

k=l 

For Si < s < T{i) we have {Wt > W} so 


0i> 


NW 

2 


for each individual j such that As > A*. 


m 
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If a particle in Z is coupled with individual j then the fitness of individual j is at least 
Xsi{i). The particle will branch at time s with probability 

NW 

2ei 

if individual j gives birth at time Sj+s and the individual k chosen to die satisfies Xi —X^ > 
W. The new particle is coupled with individual k. 

Similarly, the progeny of individual /3{i) is coupled with the branching process Zg (i) over the 
time intervals [si, (sj + T) A Tp{i) A r(i)). 

Let {Zg : s > 0) be the number of particles in the process Z of any type at time s. The 
following lemmas establish some results for the distribution of Zg. 

Proposition 3. As N tends to infinity 

wP{Zj- = 0) —>■ d and (logiV)P(Z 7 - = 1) —>■ 1. 


Proof. Let 


/(s) 

9{s) 


- 1 ) 

ujei^-d)s _ d 

^^^{w-d)s — 1 ) _ ^ W — d 

yj^{w-d)s _ d iuefi^-d)s _ d 


( 1 ) 

( 2 ) 


We use the generating function of birth-death processes from page 109 of Athreya and Ney 
[I] with birth rate w and death rate d to get 


d{x — 1) — {wx — d)e^‘^ 
w{x — 1) — {wx — 


where 


Solving the equation 


we arrive at the formulas 


F{x,s) = Y,P{Zs = k\Zo = l)xK 

k=0 


1 


IL 

6x^ 


Fix,s] 


= P{Zg = k\Zo = 1) 


ai=0 


P{Zg = 0|Zo = 1) = f{s) and 

P{Zg = i\Zo = 1) = (1 - /(s))(l - g{s))g{sy~^ for i > 1. 


(3) 


The probability that the process dies by time T is then 


f{T) 


d(e^^-d)T _ 
yjQ{W-d)T _ d 
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Because wT = log log N tends to infinity with N we have 


lim wP{Z'j- = 0) = lim wf{T) = d. 

N^OO N^OO 

The probability that there is exactly one individual at time T is 

w — d 


(i-/(r))(i-5(r)) = (i-/(r)) 


yj^(w-d)T _ ^ 


By definition, 


Therefore, 


oT 


logiV 


hm (logiV)P(Zr = 1) = lim (logiV)(l -/(r))(i - 5 (r)) = 1. 

Af—>00 N—^oo 


Proposition 4. As N tends to infinity, 


P{Zr > W) ^ 1. 


Proof. By Equation ([3]) we have 


w 


P{Zs < WlZo = 1) = f(s) + (1 - f(s))(l - g(s)) gisfi-^ 


2=1 


= /(«) + (!-/(»))(!-9(0)"). 


Therefore, 


P(Zr > WiZo = !) = (!- f{T))g{T)^ 
= (1 - f{T)) 


w 


□ 


As N tends to infinity 1 


f{T) tends to 1. For g{T)^ to tend to 1 it is enough to have 


{w — d)W 

yjg{w-d)T _ 


Because 


we have 


lim 

N^OO 6 


W 


,w'T 


1 


lim - 
w->oo 8 log log N 


= 0 


{w — d)W 
hm —7- 1 ^ - 

Af—>oo we^^~'^>' — d 


= 0 . 


□ 
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Lemma 5. Let t be a stopping time and T C {1,2,... ,N} such that |r| > N/2. Let Ai he the 
event that individual i gets at least 2W beneficial mutation events over the time interval [t,t+'T\. 
Then 

Proof. First divide F into two disjoint sets, Fi and F 2 , each of size at least N/4. Because X is a 
strong Markov process and r is a stopping time the mutation events of X that occur in the time 
interval [t,t + T] are independent of the events which occur outside of that interval and have 
the same distribution of those that occur over the interval [0,7^. 

Using the lower bound 

=f(eu, + Eu.>2) 

\ier / yieri ier2 / 

it is enough to show 

(e u. > ij = 1. 

Beneficial mutations occur on an individual at rate qg, so the probability that an individual 
gets k or more beneficial mutations in the time interval [t,t + T] is 

^ {qixTye-‘i>^'^ 


Because the beneficial mutation events of one individual occur independently of other indi¬ 
viduals. 


p [jA. = 1 - 1 - 


{qnTfie 


-quT 


N/4 


UGFi 


i=r2wi 


>1-1- 


f2WT! 


N/4 


It is enough to have 


lim 

N^OO 


X(g/ir)r2Wlg-g/2r 

4[2W1! 


= 00. 


Applying Stirling’s formula and the identity 


2 log log N 

7 W 
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it is enough to show 


N{2qfi log log N) ^21^1 ^2^1 

4(7W) r2Wl (27r[2>Vl) V21-2^] ^2^1 

tends to infinity as N does. By dropping the ceiling function in the denominator we get the lower 
bound 

A^(2g//loglog ^ Ai(2g/iloglog 

4(7W)r2Wl (27r[2>Vl)V2 [2W1 ^2^1 ^ 4(27)2W(47r>V)i/2yy;4W ' 

By definition of W we have 


yy4W _ ^l/2-log(81oglog(A''))/21oglog(7V) 


N{2qfj. log log N) r2^1 e~‘^^^T-\2W~\ ^i/2+log(8loglog(Ar))/2 loglog(7V) (^2q^ log log N) r2'*^l e~'^^J■T-\2W~\ 

4(27)2W(47r>V)V2H;4W > 4(27)2W(47rW)V2 

which tends to infinity as N tends to infinity. □ 

Lemma 6. Let T C {1,2,, N} such that |r| > W. Let s be a fixed time and let 

A = {Xl^r-K>^}- 

Then 

lim P [ > 2 ] =1. 

N^oo \ ^ / 

Vier / 

Proof. Because |r| > W we can partition B as B = Bi U r 2 where |Bi| > W/2 and IB 2 I > W/2. 

Bising the lower bound 

p(EU,>2)=p(5:U, + ^U.>2) 

\*er / yieri ier2 / 

it is enough to show 

lim P [ ^ 1^, > 1 I = 1. 

V*eri / 

Because X is a Markov process the rates of birth, death and mutation are the same as if the 
process were started at time 0. Individuals get beneficial mutations at rate q^. Events on which 
individuals may decrease in fitness occur at rate d. Because the event Ai occurs if individual i 
gets at least one beneficial mutation and has no deleterious mutations or deaths due to resampling 
events, 

P{Ai) > (1 - 
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for all i. Mutation events and death due to resampling occurs on individuals independently. 
Therefore, 


P ^ 1 a , > 1 > 1 - (1 - (1 - 


\ieri y 

To finish the proof it is enough to have 


N—i'OO 2 


lim —(1 - = oo. 


Using the bound 1 — e ^ > x — x^/2 we have 



2WqfJ,T - W{qfiTf 
2 


Since 


2 log log N 


7 


we have 



= oo. 


□ 


Proposition 7. Let e > 0. For any K >0 there exists Nq such that for N > Nq we have 


P{£\V = x) > 1 — e 


for all X € Ak- 

Proof. Fix an integer 0 < i < M. Let aa{i) = {si + T) A Ta{i) A r(i) and let cT^(i) = (sj + T) A 
Ta{i) By definition of Ak., if x E Ak then either aa{i) ^ Ta{i) or ct; 3 (z) 7 ^ Tp{i). Therefore, 

at least one of the following must occur: 

1. T{i) < Si + T, 

2. cjQ,(i) = Si + T or 

3. cr^(i) = Si + T. 

First suppose r(f) < Si + T. Then, for x E Ak, at least one of aa{i) = 'r(z) or cr^(i) = r(f). 
Without loss of generality suppose cJq(*) = r(i). At time r(i) there are at least N/2 individuals 
with fitness — W or higher. Let F be the set of all individuals that at time r(i) have fitness 
— W or higher. By LemmaEl with probability tending to 1 there will be two individuals of F 
that receive 2W beneficial mutations by time Tj + T. Both of these individuals have fitness greater 
than Xs^ (z) at time r, + T. The probability that these individuals get a deleterious mutation 
event or decrease due to a resampling event over the time interval [sj, Si+i] is (1 — which 

tends to 0 as tends to infinity. Therefore, with probability pi which tends to 1 as tends to 


infinity we have . 


If t(z) > Si + T then, because x E Ak, either (Tq(z) = Si + T or (T_a(z) = Si + T. Without 
loss of generality, suppose = Si + T. The coupling of X with 2°^ holds for the time interval 
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[si, Si + T] so by Proposition m with probability tending to 1 there will be at least W individuals 
with fitness in , oo) at time Si + T. By Lemma [6] at least two of those individuals will 

increase by at least 1 by time Sj+i- Since the probability of either of those individuals decreasing 
in fitness over the time interval [sj + T, Sj+i] is (1 —which tends to 0 as tends to infinity, 
with probability p 2 tending to 1 we have . 

Let p = Pi f\p 2 - Because there is only one point of x over any time interval [sj, Sj+i) we have 
X+ > xf/*^ for all s G This is where using the second highest fitness is important 

as we ensure that Xf cannot decrease too far even when the maximum fitness class is lost. 
Since x € Kk we have \x fl [0 ,sm+i]| = k < K. The probability that > xf/*^ on 

the k intervals in which there is a point of x is at least p^. On an interval over which there 
is no point of x, we have > xf/*\ Therefore, it is allowable for the fitness to fail to 

increase over some of the intervals on which there is no point of x. If > xf/*^ for 

at least half of the values i such that \x 0 [sj,Si+i)| = 0 then X+ must have increased by at 
least (M — k)/2 + k = (M + k)l2 > M/2 for large enough values of N. Because the events 
> xf/*^ are independent and the probability they occur is tending to 1, we can apply 
the Chernoff bound to get 

Pi£\V = x)> p^{l - e-((M-i^)/( 2 p)).(p-i/ 2 )^), 

As N tends to infinity, M tends to infinity and p tends to 1, yielding the result. □ 

Proof of TheoremUl Let e > 0 and let <5 > 0 such that (1 — > 1 — e. By Proposition [2] there 

exists K such that for N large enough we have 

P{V G Ai^) > 1 - A (4) 


We use the bound 

P{S) > P{{V G Ax} n T) = 

The set Ax G so {V G Ax} G (y{V)- By definition of conditional expectation we have 

By definition of expectation we can expand the expression as 

E[\p^t,^}P{£\P)]= [ lA,iV{uj))P{£\V = V{uj))dPiu). 

Jfi 

For G G ^ let 

MG) = p{V€G). 

Then we can push forward the integral to obtain 

[ lA^{Piu))P{£\V = V{u))dP{u) = [ lA^{x)P{£\V = x)dpv{x) 

Jn Jg 

= j P{£\V = x)dpp{x). 

J Kk 
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By Proposition [7] there exists Nq such that if > A^o then P{S\'P = x) > 1 — 5 for all x € Ak- 
Hence 

f P{£\V = x)dfi-p{x) > ( (1 — 6)dfj,'p{x) 

J Ax J Ax 

= (1 - <5)^p(Ak) 

= (1 - <5)P(iP € A^) 

> (1 — (5)^ by Equation m 

> 1-e. 


Therefore, for any e > 0 there exists Nq such that if At > Nq then P{£) > 1 
completes the proof. 


e which 
□ 
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